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Clostridium difficil e focussed antibodies 

The present invention is concerned with antibodies either specific for and which confer 
inmmnity against infection by Clostridium difficile or antibodies produced after challenge 
5 Avith a C. difficile based vaccine, methods of identifying same, and uses of same. 

C difficile is a gram-positive anaerobic bacterium, and is deemed a significant human 
pathogen causing a spectrum of diseases ranging from mild diarrhoea to fulminant 
pseudomembranous colitis (PMC) - collectively referred to as C difficile antibiotic- 
10 associated diarrhoea (CDAD). CDAD is a common, iatrogenic, nosoconual disease 
associated with substantial morbidity and mortality, especially in the elderly. Two factors 
have been assigned main roles in the pathogenesis of CDAD - the suppression of the 
resident intestinal flora by the administration of antibiotics, and the production by the 
bacterium of two high molecular weight toxins, exotoxin A and exotoxin B. 

15 

The bacterium is endemic in hospitals, and studies have shown that approximately one 
third of patients receiving antibiotic treatment in acute-care medical wards were colonised 
by C difficile while in hospital (Kyne, L., et al, 2002, Clin. Infect. Dis. 34(3), pp346-53, 
PMID: 1 1774082). Of these patients, over half went on to develop CDAD while the 

20 remainder were symptomless carriers. CDAD is a major factor in extension of patient 
hospital stay times, and estmiates suggest that the cost of this disease in the US exceeds 
$1.1 billion per year ^yne, L., et al.. Supra). Patients suffering from CDAD respond well 
to a treatment which includes a discontinuation of the inciting antibiotic and treatment with 
either of the antibiotics metronidazole and vancomycin. However, the use of e.g. 

25 vancomycin is one of last resort since it is associated with several problems. Not only may 
it cause nephrotoxicity, ototoxicity, bone marrow toxicity and the red man syndrome, but 
the problem with this treatment regime is that the CDAD often returns after successftil 
treatment of the initial episode, and this reoccurrence represents a serious clinical problem. 
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Additionally, there is evidence that C. difficile is becoming resistant to metronidazole and 
partially resistant to vancomycin, demonstrating the need for new altematives in the 
treatment of CDAD. 

5 Exotoxins A and B which are produced by pathogenic strains of the bacterium are 
cytotoxic, enterotoxic and proinflammatory, and are considered to be the main viralence 
factors of this non-invasive microorganism. However, not all infections with toxigenic 
strains result in disease, prompting the search for additional vimlence factors. Bacterial 
surface expressed antigens represent candidate virulence factors, and are also considered 

10 important since such proteins likely mediate the essential functions such as adhesion to Hie 
epithelial layer of the gut in the first step of colonization or interaction with mediators of 
local immxmity. In common with many other bacteria, C difficile expresses a crystalline 
or paraciystalline surface layer (S-layer) on the outer cell surface. Such S-layers comprise 
proteins or glycoproteins forming a regularly arranged lattice on the external surfece of the 

15 bacterium, and have previously been shown to be essential for the vimlence of pathogens 
such as Aeromanas salmonicida and Campylobacter fetus. In contrast to most bacteria 
which comprise one S-layer, C. difficile is known to comprise two superimposed 
paracrystalline S-layers, each composed of a glycoprotein subunit which varies slightly in 
apparent molecular weight among different C difficile strains. Most strains of C. difficile 

20 express two major S-layer proteins (SLPs), one of 32-38 kDa (low-MW SLP) and a second 
of 42-48 kDa (high-MW SLP). The low-MW SLP appears to be immunodominant and is 
the antigen most commonly recognised by patients suffering from CDAD, and is the only 
antigen recognised in EDTA extracts of bacteria by antisera raised in rabbits against whole 
C difficile cells (Calabi, E. et al., 2001, MoL Microbiol., 40(5) pll87-99, PMID: 

25 11401722). 

During the course of microbial infection various adaptive strategies are employed by the 
immune system. One such strategy, and arguably the most important, is the production of 



# 

an antibody response. Antibodies capable of binding antigens displayed by the infectious 
agent are produced and bind to and allow killing of the microorganism through 
complement activation, recruitment of macrophage and through direct interaction with the 
microbe itself. The therapeutic efficacy of antibodies capable of binding a given antigen 
5 varies and this is reflected in the fact that antibody production by the immune system 
matures during the course of an infection and becomes focussed in the case of a patient 
successfully fighting off an infection. 

The antibody response is elicited by the B cell repertoire where individual B cells each 
10 produce structurally diverse antibody molecules. The actual size of this B cell/antibody 
repertoire is unknown, but it is estimated that the random clonal jfrequency of reactivity for 
a given antigen may be as high as 1 in 100,000 in cultured B cells (Nobrega, A., et al., Eur 
J Immunol. 1998 Apr;28(4): 1204-1 5; PMID: 9565360). During the course of infection, 
antibodies capable of binding the pathogen are selected for by changes in the B cell 
IS population resulting in key antibodies being produced in large numbers. The mechanisms 
for these changes include clonal expansion, isotype switching, and somatic mutation of 
immunoglobulin variable regions. B cells responsible for generating antibodies which are 
able to bind a pathogen multiply, thus skewing the B cell repertoire and changing the 
proportions of B cells. 

20 

Non-antibiotic based therapeutic regimes for the treatment/prevention of C« difficile 
infection are based upon vaccination and passive immunization. Vaccination treatment 
comprises administering to a patient either a nucleic acid sequence encoding an 
immunogenic fragment of the C. difficile surface layer protein or a variant or homologue 
25 thereof, or an equivalent polypeptide fragment (as disclosed in WO 02/062379). Passive 
immunotherapy is typically achieved by administering to a patient a monoclonal antibody 
specific to an immunogen produced by a pathogen. In general, passive immunotherapy is 
particularly effective in treating immunocompromised patients who are unable to respond 



to vaccination, and to patients who need immediate therapy and cannot wait for 
vaccination to take effect. In the case of a C. difficile infection, passive inmiimization 
relies on the administration to a patient of toxin-neutralizing polyclonal immune globulin, 
(as disclosed in WO 99/2030.4), or antibodies raised against the whole bacterium and the 
toxins (as disclosed in WO 96/07430). 

Thus, as can be seen from the prior art, in order to effect treatment of the patient, an 
immunogen is first isolated and purified, administered to test animals, and cells producing 
antibody specific against the nnmunogen cloned. The range of antibodies produced by the 
clones can th«i be tested for their therapeutic eflScacy and a single monoclonal antibody 
selected which is then administered to patients to effect passive immunotherapy. 

There are, however, several problems associated with current passive immunotherapy 
regimes aimed at treating C. difficile infections. For example, passive immunotherapy 
requires that there are survivors of the C. difficile infection and patients who have been 
vaccinated. Each batch of antibody can be different leading to difficulties associated with 
standardisation and administration of the immunotherapeutic reagent In addition, the 
problem of inadvertent administration to a patient of adventitious agents (e.g HIV, HBV, 
HCV, or as yet unidentified agents) is a real one, and up-to-date screening of any 
immunotherapeutic reagent is required. Finally, the strain variability exhibited by C. 
difficile means that a given antibody may only be useful against certain strains of the 
bacterium and not against others. 

Obviously the techniques involved are somewhat complex, inconveni^t, expensive and 
time consuming. In general they require that an immunogen is isolated from the infecting 
pathogen and used to generate antibody. Simply isolating the immunogen can be extremely 
difficult and time consuming, particularly if it comprises carbohydrate or complex non- 
linear epitopes (i.e. epitopes having secondary, tertiary and/or quaternary structural 



features) which cannot be synthesised in vitro, and are impossible to isolate and produce 
as an antigen for use as e.g. a vaccine. The SLPs of C. difficile contain a glycoprotein 
subunit which varies in molecular weight between species. It may be the case that C 
difficile epitopes are only produced in vivo and are not synfliesised in vitro ^ consistent with 
for example Neisseria gonorrhoeae infections (the causal agent of gonorrhea) where 
specific antigens are only expressed upon infection of a host Such antigens fall into the 
general class of cryptoantigens. Furthermore, C difficile may display highly labile antigens 
which are difficult to work with since during use they simply degrade and the epitope they 
display is lost. With regard to this range of epitopes/antigens whose identification and/or 
in vitro use is of great difficulty or impossible, the present invention overcomes these 
disadvantages and provides a solution by providing antibodies whose CDR regions have 
been generated in response to C. difficile epitopes during antibody responses of patients 
infected with C difficile. 

In addition, the prior art typically has to attempt to achieve an equivalent of affinity 
maturation of antibodies by first synthesising a set of candidate antibodies specific to an 
antigen, testing them for their binding characteristics and then modifying the sequences 
of the candidates in order to optimise the binding. A thorough attempt at affinity 
maturation (i.e. optimising antibody binding) can require the synthesis of thousands of 
different antibodies, which can be costly and time-consuming. Because the antibodies of 
the present invention are obtained firom patients who have either been infected by a 
pathogen displaying the antigen or who have been vaccinated with an antigen, they have 
by their very nature and definition akeady imdergone affinity maturation, as is most clearly 
demonstrated by the sequences of their CDR3 regions of the variable heavy and variable 
light chains (i.e. the CDR-H3 and CDR-L3 regions). 
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According to the present invention tiiere is provided an antibody or an antigen binding 
fragment thereof having the CDR-H3 sequence selected &om the group consisting of: SEQ 
ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, and SEQ ID NO: 31. 

5 Also provided according to the present invention is an antibody or an antigen binding 
fragment thereof having the CDR-L3 sequence selected from the group consisting of: SEQ 
ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34. 

Also provided according to the present invention is an antibody or an antigen binding 
10 fragment thereof having a CDR-H3 sequence selected from the group consisting of: SEQ 
ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, and SEQ ID NO: 31, and 
a CDR-L3 sequence selected from the group consisting of: SEQ ID NO: 32, SEQ ID NO: 
33. and SEQ ID NO: 34. 

15 According to tihie present invention there is also provided a melliod for identifying 
candidate sequences of at least the C3DR3 region of antibodies specific against at least one 
antigen produced by C. difficiU during an infection or against a vaccine, comprising the 
steps of: 

(i) wifli B cells isolated from at least one patient who has been infected 
20 by Clostridium difficile or administered said vaccine, sequencing at 

least the CaDR3 region of the VH and/or VL coding regions of said B 
cells; and 

(ii) correlating said sequenced at least the CDR3 regions of the VH and/or 
VL coding regions of said B cells from said at least one patient to 

25 identify a set of candidate sequences for at least a CDR3 region of 

antibodies specific against said at least one antigen produced by 
Clostridium difficile or against said vaccine, each of said set of 
candidate CDR3 sequences or a sequence having at least 80% 



homology therewith occurring in total at a frequency of at least 1 
percent in the set of sequences determined at step (i). 

Examples of patients used as a source of B cells in such a method are himians and other 
mammals such as mice, rahhits, rats, baboons, monkeys and apes. 

In certain embodiments of the invention, step (i) may comprise the steps of: 

(i)(a) isolating B cells from at least one patient who has been infected by 

Clostridium difficile or administered said vaccine; and 
(i)(b) sequencing at least the CDR3 region of the VH and/or VL coding 
regions of said B cells; 

The sequences occurring in total at a frequency of at least 1 percent in the set of sequences 
determined at step (i) may be the candidate CDR3 sequences or a sequence having at least 
80% homology, for example 85, 90, 95, 96, 97, 98 or 99% homology therewith. Sequence 
homology is as determined using the BLAST2 program (Tatusova TA et aL, FEMS 
Microbiol Lett. 1999 May 15;174(2):247-50; PMID: 10339815) at the National Center for 
Biotechnology Information, USA (www.ncbi.nlm.nih,gov) with default parameters. 

For example, the sequences occurring in total at a frequency of at least 1 percent in the set 
of sequences determined at step (i) may be the candidate CDR3 sequences or a sequence 
having 1 or 2 amino acid changes therefrom. In general, the frequency of a dominant 
sequence is added together with the frequency of any sequence showing >80% homology, 
such that any sequence exhibiting a frequency of 1 % or greater is then deemed a candidate 
sequence. For example, a first sequence may occur at a frequency of 0.7 percent, and first, 
second, third and fourth sequences each having a single amino acid change there from each 
occur at a frequency of 0.1% - the total occurrence is therefore 1.1% and the dominant 



antibody sequence (occurring at a frequency of 0.7%) is therefore a candidate CDR3 
sequence. 



This is fundamentally different to prior art methods of making a library of antibodies in a 
vector such as aphage library where antibodies are isolated by panning and which require 
bmding witii a specific (known) antigen in a structurally- and chaige-specific state. An 
analogy by which the two can be compared would be to say that the prior art making of a 
library and subsequent pannmg is like going into a library, picking up a book and hoping 
that it is the right one. In comparison, the present invention is like going into a library, 
reading all of the books in the Ubrary and finding that one book is dominant - that there are 
many copies of it - and that it is relevant to the disease to be treated. 

In addition, the prior art use of library systems has a number of significant problems - in 
particular, some antibodies produced by a library may cause the death of the organism 
expressing them and therefore they simply cannot be detected. This is not a particular 
problem when e.g. looking for antibodies specific to cancere, but when one is searching 
for antibodies specific to an antigen fiiom a pathogen which might be homologous to one 
produced by the host expression system (e.g. Escherichia coif) then important antibodies 
cannot be expressed. The use of e.g. E. coli to e3q)ress Ubraries of e.g. human antibodies 
also suffers from the problem of codon usage - codons used by humans for specific amino 
acids can frequently not be the optimum ones for the same amino acid in E. coli or other 
host systems. This means that an important antibody might not be expressed (or at least not 
in sufficient quantities) since the codons in its sequence are highly inefficient in E. coli, 
resulting in the E. coli being unable to read through and express it in fuU. Codon 
optimisation of antibody libraries is obviously not an option since the libraries would first 
have to be sequenced, which defeats the main advantages of usmg libraries. Since the 
present invention sequences antibodies directly firom a patient, it avoids this problem. 
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The correlation step (ii) may also correlate the occurrence of the candidate sequences 
against their occurrence in patients who have not been infected with Clostridium difficile 
or administered the vaccine, sequences only being determined to be candidate sequences 
if they or a sequence having 80% homology to them occur in total at a frequency of less 
5 than 1 percent in the non-infected/vaccinated patients. 

The at least one antigen produced by the micro-organism may of course be an immunogeiL 

The present invention provides unique opportunities for vinderstanding antibody responses 
10 . to infection which were not previously available, and provides novel diagnostic and 
therapeutic opportunities. 

In particular, since the methods of the present invention bypass the need to isolate an 
antigen from C. difficile, the difficulties associated with identifying e.g. complex non- 
15 linear epitopes, epitopes containing carbohydrate, or cryptoantigens are avoided. Instead 
the methods facilitate the determination of the identity of antibodies specific against one 
or more antigens produced by the bacterium. As is explained below, the methods of the 
present invention allow the identification of therapeutically effective antibodies and can 
allow the identification of the most important parts of antibody sequences. This is 
20 particularly evident when the B cells of a given patient are sampled at different time points 
during the course of an infection by C. difficile - as the patient's immune system becomes 
focussed on producing therapeutically effective antibodies against the bacterium, a 
focussing of antibody sequences is observed, and the development of variable and non- 
variable regions within the CDR parts of the VH and/or VL sequences is observed. In 
25 addition, the methods of the present invention can allow the identification of antibodies 
best suited to the treatment of C difficile infection in particular groups of patients, such 
as age, sex and racial groups. Similarly by comparing the antibody sequences of patients 
who have recovered from infection with C. difficile with those that have not recovered 
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from infection, ineffective antibody sequences (which might be produced by both sets of 
patients) can be identified. 

The B cells isolated from the at least one patient can be peripheral B-cell lymphocytes 
(PBLs), and can be isolated from a blood sample from the at least one patient. B-cells can 
also be isolated from other sources and the present invention extends to their use. For 
example, B-cells can be isolated from spleen. 

Antibody CDR sequences are determined as detailed in the "Experiments" section using 
standard techniques and definitions of their start and end. 

As is detailed below, ftxe antibody sequences are determined from patients with C difficile 
infections. These sequences can be derived from B ceU immunoglobulin mRNA and hence 
reflect expressed antibodies and repertoires therem. The sequences determined according 
to the present invention need not be the sequences of whole antibody molecules - they 
comprise at least the VH and/or VL sequences which specify flie regions responsible for 
antigen binding. 

From the nucleotide sequences determined by the initial sequencing, putative amino acid 
sequences for the VH and/or VL regions can be determined using standard algorithms and 
software packages (e.g. see www.mrc-hnb.cam.ac.uk/pubseq/, the Staden package and 
Gap4 programs). These can be fijrther characterised to determine the CDR 
(Complementarity Determining Region) parts of the VH and/or VL sequences, particularly 
CDRl, CDR2 and CDR3. Methods for determining the putative amino acid sequences and 
identifying CDR regions are well known and detailed below. 

As well as the VH and/or VL amino acid sequences, several pieces of additional 
information can be used in the correlation step: 



(i) the strain of C. difficile causing the patient's infection; 

(ii) the time point during the infection process at which antibody 
sequences were sampled 

(iii) patient details - none or more of: sex, race, and age; and 

(iv) the range of complementarity determining regions (CDR) ie the 
variable regions of the antibody that undergo direct antigen 
binding/contact. 

Thus in a metbod according to the present invention, B cells can be used which have been 
isolated from the at least one patient at a plurality of time points during infection of the at 
least one patient by C. difficile (or post-vaccination), correlation step (ii) correlating the 
time point during infection of the at least one patient by C. difficile at which the B cells are 
isolated. 

Similarly, in a method according to the present invention, B cells can be used which have 
been isolated from at least two patients, at least one of whom has recovered from infection 
by C difficile^ and at least one of whom has not recovered fiiom infection by C difficile, 
correlation step (ii) correlating the recovery of the at least two patients from infection by 
C. difficile. 

Similarly, in a metidod according to the present invention, B cells can be used which have 
been isolated from at least two patients, the patients being infected by strains of C difficile 
producing the at least one antigen, correlation step (ii) correlating the sequenced at least 
the VH and/or VL coding regions of the B cells to identify a set of candidate sequences for 
antibodies, each of which is specific against at least one shared antigen produced by the 
diffo-ent strains of C. difficile or is specific against different antigens produced by the 
different strains of C. difficile. 
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The sequencing of the VH and/or VL regions can also be used to identify the surrounding 
antibody framework, and this can be used to determine the antibody isotype. This can then 
be used in the correlation step to determine whether specific antibody isofypes are 
particularly usefiil. 

5 

By the term "therapy" is meant any treatment which is designed to cure, alleviate, remove 
or lessen the symptoms of, or prevent or reduce the possibility of contracting any disorder 
or malfunction of the human or animal body 

10 The invention is suitable for identifying antibody sequence useful in treating C. difficile 
infection. 

Using the abovementioned additional information fields in the correlation step (ii), it is 
easy to perform the correlation using e.g. only CDR sequences, CDRs from eiflier a given 
15 patient or a given infection or both, CDRs from a given site or patient group, or CDRs 
from different sampling times over the course of infection. 

These correlations are CDR-q>ecific because these are the hypervariable regions 
responsible for antigen binding and sequence diversity of the antiboify repertoire (although 
20 the framework regions can be used in the correlation step as well to reveal the antibody 
isotype). 

The present invention is useful in a wide range of applications, e.g. antibody therapy, and 
vaccine studies. 

25 

Antibody Therapy: 

This involves the passive transfer of antibody to non-immune individuals (e.g. patients 
undergoing chemo/radio therapy, inmnmosuppression for organ transplantation, 
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iiminmocomproinised due to underlying conditions such as diabetes, trauma etc, also the 
very young or very old). 

The present invention can be used to deteraiine the sequences of antibodies conferring 
5 immunity by looking for over-represented VH and/or VL sequences in patients who have 
overcome infection. These protective antibodies can be re-synthesised at the genetic level, 
over-expressed in E. coli (or other expression systen[is) and purified. The resultant purified 
recombinant antibody can then be administered to patients as a passive immunotherapy. 
Antibodies can also be ordered fi"om commercial suppliers such as Operon Technologies 
10 Inc., USA (www.operon.com) by simply supplying them with the seiquence of the antibody 
to be manufactured. 

Therapeutically useful sequences can be identified in a number of ways which can be used 
in the correlation step (ii): 



15 



1) 



By looking for antibody sequence over-representation in patients who 
have recovered ftom C. difficile infection - B cells that produce 
antibodies capable of pathogen binding undergo clonal expansion, 
hence high frequency antibody sequences are most likely to bind 
pathogen and confer immunity. 



20 



25 



2) 



By following alterations in the antibody repertoire over the course of 
C difficile infection. During infection, antibodies undergo a 
maturation process to improve pathogen binding and this is 
characterised by sequence alterations. Also, B cell clonal expansion 
is more prevalent in the final stages of infection where the infection 
is cleared. The most frequent antibodies in the repertoire are chosen 
as candidates for immunotherapy. Following the maturation process 
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the candidate antibody will demonstrate which key amino acid residue 
alterations improve antigen binding and this information can be used 
to improve antibody design. 

3) Analysis of the repertoires from different patients with C difficile 
infection can identify shared and identical protective antibodies. 
These antibodies are attractive choices for immunotherapy as their 
occurrence in different patients suggests a strong positive selection in 
their favour; hence an important role in antibody-based immimity. 

4) Analysis of the repertoires from patients infected with different strains 
of C. difficile. In this situation, if a common antibody sequence is 
found in the repertoires for both strains, the antibody may be useftil 
in treating both strains of the bacterium, ie displaying a broad 
spectrum. 

5) Analysis of the repertoires for afiSnity maturation of sequences. 
Vaccination Studies: 

Vaccination protects against infection by priming the immune system with 
pathogen-derived antigen(s). Vaccination is effected by a single or repeated e)q)osures to 
the pathogen-derived antigen(s) and allows antibody maturation and B cell clonal 
expansion without the deleterious effects of the full-blown infectious process. T cell 
involvement is also of great importance in effecting vaccination of patients. The present 
invention can be used to monitor the immunisation process with experimental C. difficile 
vaccines. Subjects are given the experimental vaccine and VH and/or VL sequences are 
amplified from the patient and the antibody repertoire analysed as described above. 
Qualitative and quantitative assessment of the vaccination process is possible: 
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1) The VHAn:. repertoire of a group of patients who have been 
administered an experimental C. difficile vaccine is assessed. A 
precise molecular dissection of the resulting antibody response to the 
vaccine can be performed to determine (a) clonal expansion of 
protective antibodies, (b) protective antibody production in different 
populations (for example differing ethnic groups with different 
genetic backgrounds, as well as e.g. age and sex groups), and (c) the 
long term effect of the vaccine i.e. the antibody response over the long 
term - long term antibody memory in the immune system as well as 
any autoimmune defects; 

2) Where vaccination results in an increased frequency of a given 
antibody, the antibody sequence can easily be cloned and expressed 
and used in animal models of infection. If the antibody is protective, 
it may be useful in itself as an immunotherapy, but this also 
demonstrates that the vaccine is likely to be protective without 
subjecting the human subject to experimental infection. 

Also provided to the present invention is a method of manufacture of a medicament for the 
treatment of a C. difficile infection which produces at least one antigen, comprising the 
steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least one antigen produced 
by C. difficile; and 

(ii) synthesising at least one antibody comprising a said candidate 
sequence specific against the at least one antigen produced by C. difficile. 
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Medicaments and methods of treatment according to the present invention will be readily 
apparent to one skilled in tiie art. Medicaments may be prepared using phamiaceutically 
acceptable carriers, diluents or excipients (Remington's: The Science and Practice of 
Phamiacy (1995)Mack Publishing Company, Easton, PA, USA). The medicaments and 
methods of treatment may be effected using a pharmaceutically effective amount of the 
antibody/antigen-binding fragment Appropriate dosages will be readily apparent to one 
skilled in the art and may be readily determined, for example by means of dose-response 
experiments 

Also provided according to the present invention is a method of treatment of an infection 
of a patient by C difficile which produces at least one antigen, comprising the steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least one antigen produced 
by C. difficile^ 

(ii) synthesising at least one antibody comprising a said candidate 
sequence specific against the at least one antigen produced by C. difficile^ and 

(iii) administering a therapeutically efifective quantity of said at least one 
synthesised antibody to said patient. 

Also provided according to the present invention is a method of producing a database 
which identifies candidate sequences for antibodies specific against at least one antigen 
produced by C difficile^ comprising ttie steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least one antigen produced 
by C difficile^ and 

(ii) storing the data produced by said method in said database. 
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Also provided is a method of generating a report which identifies candidate sequences for 
antibodies specific against at least one antigen produced by C. difficile^ comprising fiie 
steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least one antigen produced 
by C. difficile; and 

(ii) producing a report comprising the data produced by said method of 

step (i). 

Also provided according to the present invention is a method for determining the eflBcacy 
of a vaccine, comprising tiie steps of: 

(i) with B cells isolated firom at least one patient who has been 
administered said vaccine, sequencing at least the CDR3 region of the 
VH and/or VL coding regions of said B cells; and 

(ii) correlating said sequenced at least the CDR3 region of the VH and/or 
VL coding regions of said B cells to identify a set of candidate 
sequences for at least the CDR3 region of antibodies specific against 
said vaccine, each of said set of CDR3 candidate sequences or a 
sequence having at least 80% homology therewith occurring in total 
at a frequency of at least 1 percent in the set of sequences determined 
at step (i). 

In certain embodiments of the invention, step (i) above may comprise the steps of: 
(i)(a) administering said vaccine to at least one patient; 
(i)(b) isolating B cells firom said at least one patient; and 
(i)(c) sequencing at least the CDR3 region of the VH and/or VL coding 
regions of said B cells. 
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Correlation step (ii) may comprise correlating said sequenced at least a CDR3 region of 
the VH and/or VL coding regions of said B cells with sequenced at least a CDR3 region 
of the VH and/or VL coding regions of B cells isolated from at least one patient who has 
been infected with C difficile against which vaccination with said vaccine is intended to 



As described above, additional information may be used in the correlation of step (ii). 



The antibody sequences determined from patients who have been administered the vaccine 
can be compared with antibody sequences isolated from patients who have been infected 
with a difficile against which the vaccine is intended to stimulate a protective immune 
response in patients. Thus it is possible to determine whether the vaccine results in the 
generation of antibodies which are also produced in response to infection by C. difficile 
itself. In particular, the comparison can be made using antibody sequences detennined 
from patients who have recovered from infection by C. difficile. 

In particular the method may determine the efficacy of the vaccine in stimulating a 
protective immune response against C difficile against which vaccination witli the vaccine 
is intended to stimulate a protective immune response. 



stimulate a protective immune response. 



including: 



(i) 

(ii) 

(iii) 



the time since administration of the vaccine to the patient; 

patient details - none or more of: sex, race, and age; and 

the range of complementarity determining regions (CDR) ie the 

variable regions of the antibody that undergo direct antigen 

binding/contact. 



Also provided is a method of producing a database which identifies the efficacy of 
vaccine, comprising the steps of: 
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(i) performing a mefhod according to the preset invention to determine 
the efficacy of said vaccine; and 

(ii) storing the data produced by said method in said database. 

Also provided according to the present invention is a method of generating a report which 
identifies the efficacy of a vaccine, comprising the steps of: 

(i) performing a method according to the present invention to determine 
the efBcacy of said vaccine; and 

(ii) producing a report comprising the data produced by said method. 

Also provided according to the present invention is a diagnostic test method for identifying 
a Clostridium difficile infection in a patient, comprising the steps of: 

(i) with B cells isolated firom said patient, sequencing at least the CDR3 
region of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or 
VL coding regions of said B cells with a set of sequences for at least 
the CDR3 region of antibodies specific against Clostridium difficile^ 
and determining whether each of said set of CDR3 sequences or a 
sequence having at least 80% homology therewith occurs in total at 
a frequency of at least 1 percent in the set of sequences determined at 
step (i); and 

(iii) correlating the results of comparison step (ii) to determine the 
presence or absence of a Clostridium difficile infection in said patient 

Also provided according to the present invention is a diagnostic test method for 
determining the susceptibiUty of a patient to Clostridium difficile infection, comprising the 
steps of: 
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(i) with B cells isolated from said patient, sequencing at least the CDR3 
region of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or 
VL coding regions of said B cells with a set of sequences for at least 
the CDR3 region of antibodies specific against Clostridium difficile, 
and determining whether each of said set of CDR3 sequences or a 
sequence having at least 80% homology therewith occurs m total at 
a frequency of at least 1 percent in the set of sequences determined at 
step (i); and 

(iii) correlating the results of comparison step (ii) to deteimine the 
susceptibility of said patient to Clostridium difficile mfection. 

The diagnostic test method could be used to sequence the CDR3 sequences of patients in 
for example a hospital ward, to detennine which patients were susceptible to infection by 
the bacterium. The presence of C. difficile specific antibodies in a sample (as detemiined 
by sequencing of CX)R3 regions) would likely indicate that such patients were capable of 
mounting a protective immune response toward the bacterium and would not require 
antibiotic treatment. In contrast, those patients identified who had not produced C. difficile 
specific antibodies could be identified as being immunocompromised and therefore 
candidates for antibiotic treatment during the course of a C. difficile infection. 

Also provided according to the present invention is a diagnostic test kit for performing a 
diagnostic test method according to the present mvention. The kit may contain reagents 
(e.g. PGR primers) which would enable a person skilled in the art to identify CDR3 
sequences from a patient. 



regards antibodies usable in the present invention, particularly synthetic antibodies (the 
m "antibody" is also considered to incorporate antigen binding fragments of antibodies 
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unless the context otherwise requires), the antibody may be a whole antibody or an antigen 
binding fragment thereof and may in general belong to any immunoglobulin class. Thus, 
for example, it may be an IgM, IgG or an IgA antibody. The antibody or fragment may be 
of animal, for example, mammalian origin and may be for example of murine, rat, sheep 
or human origin. It may be a natural antibody or a fragment thereof, or, if desired, a 
recombinant antibody fragment; i.e. an antibody or antibo(fy fragment which has been 
produced using recombinant DNA techniques. 

Particular recombinant antibodies or antibody fragments include, (1) those having an 
antigen binding site at least part of which is derived from a different antibody, for exanq>le 
those in which the hypervariable or complementarity detmnining regions of one antibody 
have been grafted into the variable framework regions of a second, different antibody (as 
described in, for example, EP 239400); (2) recombinant antibodies or fragments wherein 
non-Fv sequences have been substituted by non-Fv sequences from other, different 
antibodies (as described in, for example, EP 171496, EP 173494 and EP 194276); or (3) 
recombinant antibodies or fragments possessing substantially the structure of a natural 
immunoglobulin but wherein the hinge region has a different number of cysteine residues 
from that foimd in the natural immunoglobulin but wherein one or more cysteine residues 
in a surface pocket of the recombinant antibody or fragment is in the place of another 
amino acid residue present in the natural inununoglobulin (as described in, for example, 
WO 89/01782 and WO 89/01974). 

Teachings of texts such as Harlow, E. and Lane, D. ("Using Antibodies: A Laboratory 
Manual", Cold Spring Harbor Laboratory Press, New York, 1998) fiirther details 
antibodies, antibody fragments, titieir preparation and use. 

The antibody or antibody fragment may be of polyclonal or monoclonal origin. It may be 
specific for at least one epitope. 
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Antig^ binding antibody fragments include, for example, fi:agments derived by proteolytic 
cleavage of a whole antibody, such as F(ab')2, Fab' or Fab fragments, or fragments 
obtained by recombinant DNA techniques, for example Fv fragments (as described, for 
example, in WO 89/02465). 

5 

Where it is desired to produce recombinant antibodies according to the invention tiiese may 
be produced using, for example, the methods described in EP 171469, BP 173494, BP 
194276 and EP 239400. Antibody fragments may be produced using conventional 
techniques, for example, by enzymatic digestion with pepsin or papain. 

10 

Antibodies according to the invention may be labelled with a detectable label or may be 
conjugated with an effector molecule, for example a drug eg. an antibacterial agent or a 
toxin or an enzyme, using conventional procedures and the invention extends to such 
labelled antibodies or antibody conjugates. 

15 

The contents of each of the references discussed herein, including the references cited 
therein, are herein incorporated by reference in their entirety. 

Where "PMID:" reference niunbers are given for pubUcations, these are the PubMed 
20 identification numbers allocated to them by the US National Library of Medicine, from 
which fiill bibliographic information and abstract for each publication is available at 
www.ncbi.nlm.nih.gov. 

The invention will be fiirther apparent from the following description, with reference to 
25 the several figures of the accompanying drawings, which show, by way of example only, 
forms of identifying candidate sequences for antibodies specific against at least one 
antigen and of determining the efficacy of a vaccine. 
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Of the Figures: 



Figure 1 shows the general principles for the isolation and DNA 
sequencing of VH and/or VL antibody gene fragments from B 
cells. Reference numeral 10 indicates primary PGR, refer^ce 
numeral 20 the cloning into a DNA sequencing vector using 
the T-tailing principle - random orientation, and reference 
number 30 the determining of the nucleotide sequence of 
forward and reverse strands using Ml 3 forward and reverse 
primers; and 

Figure 2 shows a schematic depiction of resyntihesised recombinant 
antibody gene cassette. VH and VL regions are linked with a 
glycine serine-rich linker. Each variable domain contains three 
Complementarity Determining Regions (CDRs) which 
participate in antigen binding. 
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EXPERIMENTS 

The experiments below detail methods for identifying antibody VH and/or VL CDR 
sequences that confer immiinity in patients with C. difficile infection. These sequences can 
then be used to produce synthetic antibodies and these synthetic antibodies are suitable for 
administration to patients for therapy. Also detailed are the VH and VL CDR3 sequences 
identified by the methods. 

Also detailed are methods of determming the efBcacy of a vaccine in generating a desired 
immune response in a patient. 

Unless stated otherwise, all procedures were performed using standard protocols and 
foUowmg manufacturer's instructions where applicable. Standard protocols for various 
techniques including PGR, molecular cloning, manipulation and sequencing, the 
manufacture of antibodies, epitope mapping and mimotope design, cell culturing and 
phage display, are described in texts such as McPherson, M.J. et al (1991, PGR: A 
practical approach, Oxford University Press, Oxford), Sambrook, J. et al (1989, Molecular 
cloning: a laboratory manual. Cold Spring Harbour Laboratory, New York), Huynh and 
Davies (1985, "DNA Cloning Vol I - A Practical Approach", IRL Press, Oxford, Ed. D.M. 
Glover), Sanger, F. et al. (1977, PNAS USA 74(12): 5463-5467), Harlow, E. and Lane, 
D. ("Using Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory Press, New 
York, 1998), Jung, G. and Beck-Sickinger, A.G. (1992, Angew. Chem. Int. Ed. Eng., 31: 
367-486), Harris, M.A. and Rae, I.F. ("General Techniques of Cell Culture", 1997, 
Cambridge University Press, ISBN 0521 573645), "Phage Display of Peptides and 
Proteins: A Laboratory Manual" (Eds. Kay, B.K., Winter, J., and McCafferty, J., Academic 
Press Inc., 1996, ISBN 0-12-402380-0). 
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Reagents and equipment useful in, amongst others, the methods detailed herein are 
available from the likes of Amersham (www.amershanLco.uk), Boehringer Mannheim 
(www.boehringer-ingelthemi.com), Clontech (www.clontech.com), Genosys 
(www.genosys.com), Millipore (www.millipore.com), Novagen (www.novagen.com), 
Perkin Ehner (www.perkinelmer.com), Pharmacia (www.phamiacia.com), Promega 
(www.promega.com), Qiagen (www.qiagen.com), Sigma (www.sigma-aldrich.com) and 
Stratagene (www.stratagene.com). 

The term "antibody" in its various grammatical forms is used herein to refer to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antibody combining site or paratope. Such 
molecules are also referred to as "antigen binding fragments" of immunoglobulin 
molecules. 

Illustrative antibody molecules are intact immunoglobulin molecules, substantiaUy intact 
immunoglobuUn molecules and those portions of an immunoglobulin molecule that contain 
the paratope, including those portions known m the art as Fab, Fab', F(ab')2, scFv and F(v). 

The term "antibody combining site" refers to that structural portion of an antibody 
molecule comprised of a heavy and light chain variable and hypervariable regions that 
specifically binds (immunoreacts with) antigen. 

The identification and sequencing of B cells producing antibodies and the analysis of the 
resulting data comprises the following basic steps: 

1) Isolation of VH and/or VL codmg regions from circulatory B cells of human patients. 

2) Determining the Nucleotide sequence of VH and/or VL repertoires. 

3) Determination of VH and/or VL primary amino acid sequences. 

4) Extraction of CDR regions in silico - incorporation into the database 
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5) Detection of dominant CDR & framework regions in VH and/or VL repertoire. 

6) Construction & Production of therapeutic recombinant antibodies from dominant 
antibody sequences. 

1. Isolation of VH and/or VL coding regions from circulatory B cells of human 
patients 

For this method, human patients with C difficile infection were selected as donors of 
immunised B cells. The criteria for selection were: 

(a) The patient must exhibit a pronounced antibody response to C. difficile, detectable for 
example by Western blotting. Samples of antibodies were collected from a patient blood 
sample. Samples were diluted and tested for immunoreactivity by Western blottuig using 
antigen(s) derived from C. difficile. In such an assay, patients exhibiting strong antibody 
responses showed pronounced recognition/antibody binding of C difficile antigens as 
detected using anti-human polyclonal detection antibodies. 

(b) The patient has survived the course of C, difficile infection - this improves the chance 
of generating B cell responses for antibodies capable of neutralizing the bacterium. 
Peripheral B-cell lymphocytes (PBLs) were collected from infected patient blood samples. 
For this heparinised blood was diluted in PBS (20 ml total) and overlaid onto a 15 ml 
cushion of Ficol Hypaque (Phannacia; unless stated otherwise, all chemicals and culture 
media were purchased from Sigma, UK) in a 30 ml centrifiige tube. The PBLs were then 
collected by centrifogation (400 x 5 minutes) and washed in PBS and harvested by 
centrifugation again. RNA was prepared from PBLs using the QuickPrep mRNA 
purification kit (Phannacia) exactly according to the manufacturers instractions. The 
isolated mKNA was used to prepare cDNA via a reverse transcriptase reaction (Promega 
cDNA synthesis kit). For this 2ng of mKNA was re-suspended m 16 ^iL nuclease-free 
water and heated to 65 °C for 3 minutes (to denature secondary structure) and then 
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iramediately chilled on ice for 1 minute. The mRNA was then added to the following 
cocktail: 8 25 mM MgCl2, 4 fiL dNTP mix (1 0 mM with respect to each ribonucleotide 
triphosphate), 1 jiL RNAsin 40 u jiL"^ stock solution, 1.2 \iL AMV reverse transcriptase 
(25 u ^L'* stock solution), 6 fxL cDNA 10 pmol ^iL* primer (see Figure 1 - cDNA 
S synthesis). The mixture was incubated at 42 °C for 1 hour and then incubated at 100 ^'C 
for 3 minutes to. stop the reaction. 

DNA coding for antigen binding regions was then amplified by PGR using the cDNA 
prepared from patients PBLs. For this, the cDNA was used in the following PGR reaction 

10 to produce either heavy chain or light chain-derived antibody variable region DNA (see 
Figure 1): 2.5 jliL cDNA, 33 p-L water, 4 \iL dNTP mix (25 mM with respect to each 
deoxynucleotide triphosphate), 5 jiL Taq reaction buffer (Perkin Elmer), 2.5 pL of an 
equimolar primer mix (final concentration of 20 pmol with respect to each primer) and 0.5 
pL Taq DNA polymerase ( 1 u pL '\ Perkin Elmer Corp). The forward and reverse primers 

15 used in these reactions are depicted in Figure 1, and their respective nucleotide sequences 
are listed in Table 1 • PGR reaction conditions were 94 **G for 1 minute, 57 "^C for 1 minute 
and 72 ^'G for 2 minutes for a total of 30 cycles, with an extended denaturation (94 **G for 
5 minutes) prior to cycle 1 and an additional extension step after the end of cycle 30 (72 
°G for 10 minutes). PGR was performed using a Perkin Elmer 9700 GeneAmp PGR 

20 machine. 5 nL of the PGR reaction was run on a 1% agarose gel to check the ampUfication 
of the expected 393 base pair (bp) product. The remaining product was run on a 0.8% low 
melting point (LMP) agarose gel and the 393 bp band was excised using a clean scalpel 
blade. DNA was extracted firom the agarose gel slice using a GeneGlean n Kit (Anachem, 
Luton, UK). 

25 

The PGR resulted in two fragment types -derived from VH and VL regions respectively, 
depending on the PGR primer sets used (for details of primers sets for antibody genes and 
a PGR schematic see Table 1 and Figure 1, respectively). 
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Table 1 : Primers used in generating and sequencing VH and VL gene fragments. AU 
sequences are given in the 5' to 3' direction. Use of primers is described above and in 
Figure 1. 



Table 1 



Primers used in cDNA svnthesis reantir^^ s 

HuIgGl-4P0R 6TC CAC CTT GGT GTT GCT GGG CTT 

HuCLFOR AGA CTC TCC GCT GTT GAA GCT CTT 



SEQIDNO: 
01 
02 



Primers for primary VHR 



HUJH1-2FOR 
HUJH3FOR 
HtUH4-5FOR 
HUJH6FOR 



TGA 
TGA 
TGA 
TGA 



GGA 
AGA 
GGA 
GGA 



GAC 
GAC 
GAC 
GAC 



GGT 
GGT 
GGT 
GGT 



GAC 
GAC 
GAC 
GAC 



HuVHlaBACK CAG GTG CAG 

HuVH2bBACK CAG GTG AAC 

HuVHSaBACK CAG GTG CAG 

HuVH4aBACK CAG GTG CAG 

HuVH4bBACK CAG GTG CAG 

HuVHSaBACK GAG GTG CAG 

HuVH6aBACK CAG GTA CAG 



CAG 
CAT 
CAG 
CGT 



GGT 
TGT 
GGT 
GGT 



GCC 
CCC 
TCC 
CCC 



CTG 
TTA 
CTG 
CTG 
CTA 
CTG 
CTG 



GTG 
AGG 
GTG 
CAG 
CAG 
TTG 
CAG 



CAG 
GAG 
GAG 
GAG 
CAG 
CAG 
CAG 



TCT 
TCT 
TCT 
TCG 
TGG 
TCT 
TCA 



GG 
GG 
GG 
GG 
GG 
GC 
GG 



SEQIDNO: 
03 

04 
05 
06 



07 
08 
09 
10 
11 
12 
13 



HuJKlFOR 
HUJK2F0R 
HUJK3FOR 
HUJK4FOR 
HuJKSFOR 



ACG TTT 
ACG TTT 
ACG TTT 
ACG TTT 
ACG TTT 



GAT 
GAT 
GAT 
GAT 
ATT 



TTC 
CTC 
ATC 
CTC 
CTC 



CAC 
CAG 
CAC 
CAC 
CAG 



CTT 
CTT 
TTT 
CTT 
TCG 



GGT 
GGT 
GGT 
GGT 
TGT 



CCC 
CCC 

CCC 
CCC 
CCC 



14 
15 
16 
17 
18 



HuVKlBACK 
HUVK2BACK 
HUVK3BACK 
HUVK4BACK 
HuVKSBACK 
HUVK6BACK 



GAC 
GAT 
GAA 
GAC 
GAA 
GRA 



ATC 
GTT 
ATT 
ATC 
ACT 
ATT 



CAG 
GTG 
GTG 
GTG 

ACA 
GTG 



ATG 
ATG 
TTG 
ATG 
CTC 
CTG 



ACC 
ACT 
ACG 

ACC 
ACG 
ACT 



CAG 
CAG 
CAG 
CAG 
CAG 
CAG 



TCT 
TCT 
TCT 
TCT 
TCT 
TCT 



CC 
CC 
CC 
CC 
CC 
CC 



19 
20 
21 
22 
23 
24 



Primers fo r DNA Seq uenr.ingr 



M13 forward 
MlSreverse 



GTA AAA 
AAC AGC 



CGA CGG CCA GT 
TAT GAC CAT G 



25 
26 



VH and/or VL gene fragments were cloned into cloning vector pGEM-T easy (Promega 
Corporation) to facilitate DNA sequencing. For this 3 ng PGR product was prepared for 
restriction using QIAquick PGR purification spin columns (Qiagen, UK) according to Ae 
manufacturers instructions. DNA was eluted from the spin column in 40 jiL buffer EB. 
Purified PGR product (2 jiL) was mixed with 1 jiL pGEM-T easy vector, 6 liL water and 
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1 )iL DNA ligase and the mixture ligated for 1 h at room temperature. Ligations were then 
transformed into electroconq>etent £1 coU TGI cells (Stratagene) by electroporation, and 
plated out onto agar plates containing Ampicillin 100 ng ml"' IPTG (100 \iM) and X-gsA 
(0.006 % w/v). Colonies were allowed to grow ovemight at 37 °C and then stored at 4 °C. 
Recombinant colonies are identified as white colonies on this media. 

2. Determining the nucleotide sequence of VH and/or VL repertoires 

DNA was first prepared fix>m VH and/or VL recombinant E. coli clones. For this, 
individual colonies were each transferred to 1.2 ml LB brotii supplemented witii 
Ampicillin 100 jig ml"* using a 96 well plate format Cultures were the grown at 37 °C for 
24 hours. Bacterial cells were harvested by centrifiigation at 4,000 x g, 30 minutes and the 
supematants discarded. Plasmid DNA was prepared using Wizard SV 96 plasmid 
piuification kits (Promega Corporation) essentially following the manufacturer's 
instructions. Yields of plasmid DNA were typically in the order of 5 jig per 1 .2 ml starter 
culture. 

DNA sequencing reactions were performed using the DYEnamic ET dye terminator cycle 
sequencing kit (Amersham Pharmacia Biotech). Purified plasmid DNA (0.5 ng) was mixed 
with 8 nL DYEnamic ET terminator reagent premix and 1 ^L M13 forward or reverse 
primer (5 \M) in a total reaction volume of 20 |iL. Thermal cycling was then performed 
usmg a GeneAmp PGR system 9700 (Perkin Elmer) with the foUowmg parameters: 95 "C, 
20 seconds; 50 °C, 15 seconds; 60 "C, 1 minute; 30 cycles). Reactions were performed 
using 96 well format non-skirted ELISA plates (AB Gene). Unincorporated dye terminator 
were removed using precipitation. For this, ethanol samples were mixed with 2 p.L 7.5 M 
ammonium acetate and 55 jiL of 100 % ethanol and centrifiiged at 3000 g for 30 minutes. 
DNA pellets were washed with 70 % ethanol and re-suspended m 20 ^L loading solution. 
Reactions were sequenced using a MegaBACE 1000 DNA sequencer (Amersham 
Pharmacia Biotech) following the manufacturers instructions (2 kV injection voltage for 



-so- 



so s with electrophoresis at 6 kV for 200 minutes). Chromatograms were exported using 
the .scf file format for finishing and archiving. 

3. Determination of VH and/or VL primary amino acid sequences 

DNA sequences of VH and/or VL were determined in both forward and reverse strands 
(using M13 forward and reverse primers respectively) and compared in order to highlight 
discrepancies and maintain a high degree of accuracy. This was done using the Staden 
suite of programs (Staden, R. (1996) The Staden Sequence Analysis Package. Molecular 
Biotechnology 5, 233-241). First, sequence .scf files were entered into the PREGAP 
program where vector sequence was stripped off and the quality of the sequence was 
assessed. Poor quaUty sequences where the DNA sequence was ambiguous were rejected 
and re-sequenced. Forward and reverse strands were matched using the GAP program to 
highlight and resolve areas contaming sequencing artefacts. The orientation of the VH 
and/or VL sequence was noted and reverse complemented where necessary to produce only 
forward reading jframe orientations for translation. VH and VL gene sequences were then 
translated into amino acid sequences. Each sequence was given a unique identifier (name). 

4. Extraction of CDR regions in silico - incorporation into the database 

A general teaching of identifying CDR regions is at www.bioinforg.uk/abs/ 

The following set of rules will allow the definition of the CDRs in an antibody sequence. 
There are rare examples where these virtually constant features do not occur (for example 
the hiunan heavy chain sequence EU does not have Trp-Gly after CDR-H3). The Cys 
residues are the best-conserved feature. 



CDR-Ll 

Start 

Residue before 



Approx. residue 24 
always a Cys 



• 
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Residue after 



Length 



always a Tip, Typically Tip-iyr-Gln, but also, Tip-Leu-Gln, 
Tip-Phe-Gln, Trp-Tyr-Leu 
10 to 17 residues 



CDR-L2 
Start 

Residues before 
Length 



always 16 residues after the end of CDR-Ll 
generally He-Tyr, but also, Val-Tyr, Ile-Lys, Ile-Phe 
always 7 residues 



10 CDR-L3 
Start 

Residue before 
Residues after 
Length 

15 

CDR-Hl 
Start 

Residues before 
Residues after 
20 Length 



always 33 residues after end of CDR-L2 
always Cys 

always Phe-Gly-Xaa-Gly (SEQ ID NO: 92) 
7 to 11 residues 



Approx. residue 26 (always 4 after a Cys) 

always Cys-Xaa-Xaa-Xaa (SEQ ID NO: 93) 

always a Trp. Typically Tip-Val, but also, Trp-Ile, Trp-Ala 

10 to 12 residues 



25 



CDR-H2 

Start 

Residues before 

Residues after 
Length 



always 15 residues after the end of CDR-Hl 

typically Leu-Glu-Tip-Ile-Gly (SEQ ID NO: 94), but a number of 

variations 

Lys/Arg-Leu/IleA^al/Phe/Thr/Ala-Thr/Ser/Ile/Ala 
16 to 19 residues; 
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CDR-H3 



Residues before 



Start 



always 33 residues after end of CDR-H2 (always 2 after a Cys) 
always Cys-Xaa-Xaa (typically Cys-Ala-Arg) 



Residues after 



always Trp-Gly-Xaa-Gly (SEQ ID NO: 95) 
3 to 25 residues 



5 Length 



5. Detection of dominant CDR & framework regions in VH and/or VL repertoire 

The database was constructed using SQL Server Database software (Microsoft). This 
allowed the database to be queried using SQL and allowed CDRl, CDR2 and CDR3 

10 sequences to be extracted from any range of database VH and/or VL sequences in FASTA 
format. Extracted CDRs were then subject to multiple alignment using CLUSTAL X in 
such a way so that identical or very similar CDRs are grouped together in blocks 
(Thompson, J.D,, Higgins, D.G. and Gibson, TJ. (1994) CLUSTAL W: improving the 
sensitivity of progressive multiple sequence alignment through sequence weighting, 

15 positions-specific gap penalties and weight matrix choice. Nucleic Acids Research, 
22:4673 -4680). From this flie firequency of recurring CDRs was determined and CDR- 
dominance in the immunoglobulin repertoire was established. Altematively, a graphical 
interface was employed using Probiosys (proBionic EMEA, Amsterdam). ProBiosys 
prepares dendrograms from CLUSTAL .phy files and allows a rapid visual interpretation 

20 of the relationships among large numbers of aligned sequences. The whole process was 
performed for CDRs from the same patient sampled at different points during an illness, 
for CDRs from patients with different infections, or from sex-matched, age-matched or 
race-matched patient groups, depending on the search and the additional information added 
to the database entries. 



25 
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6. Construction & production of therapeutic recombinant antibodies from dominant 
antibody sequences 

Once dominant antibody sequences have been recognised in a given repertoire, the 
information can be used to infer the presence of CDRs and frameworks that confer 
5 immunity. Selected VH and/or VL sequences can be identified and their gene sequences 
resynthesised using synthetic oligonucleotides. This is important as it also allows the 
human-derived gene sequence to be codon-optinndsed for E. coli in order to improve 
protein expression. Dominant VH and/or VL sequences can be spliced together using a 
spacer (linker) sequence (Figure 2). This gene cassette, termed a scFv, can be resynthesised 

10 to include terminal Ndel and Notl restriction sites for cloning into expression vector 
pET29b (Novagen). ScFv DNA can be cut with Ndel (cuts in VH) or Notl (cuts m VL) 
using the following reaction: 40 ^il DNA (4 ^ig), 10 p.1 of Restriction enzyme buffer D 
(Promega), 47 ^il water and 2 ^l Ndel and Notl enzyme (10 u Promega) with 
digestion for 4 h at 37 ""C, DNA can be then fractionated on 0,7% agarose TAB gels and 

15 the digested DNA excised from the gel and purified from the agarose slice using the 
Geneclean n kit (Bio 101) exactly according to the manufacturers instruction. pET29b 
vector DNA can be cut with Ndel and Notl as described above. 1 ^g vector can be mixed 
with 1 ng restricted VL DNA resuspended in 8.5 |li1 water and ligated by addition of 1 jil 
10 X ligation buffer (Boehringer Mannheim) and 0.5 ^il DNA ligase (3 u )iV ; Boehringer 

20 Mannheim), followed by ligation overnight at 14 ''C. 

The entire ligation can be transformed into E. coli TG2 cells (Stratagene, UK) by 
electroporation. Transformants can be plated out onto agar plates containing Tetracycline 
(25 |xg ml"^ - selective for recA ::TnlO on the chromosome of TG2) and Kanamycin (50 
25 Jig ml"^ - selective for pET29). Recombinant plasmid DNA prepared from individual 
clones can be checked for scFv sequence by digestion with Ndel and Notl as described 
above. 
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Recombinant scFvs can be over-expressed in E. coli strain JMl 09(DE3) and purified thus 
enabling biochemical and biological characterisation. Recombinants can be spread on LB 
plates supplemented with 50 nil'' kanamycin and a single colony can be used to 
inoculate a 10 microlitre starter culture of LB broth with 50 jig ml ' kanamycin. For this, 
5 a 1 L e3q)ression culture can be prepared in the presence of kanamycin for 4-5 h at 37 "C 
with shaking at 300 ipm. At ODgoo =1, induction can be performed using IPTG and the 
cells can be grown with vigorous aeration for a further 3-5 h. Cells can then be harvested 
by centrifugation (4000 x g, 15 minutes, 4 °C). The cell pellet can be resuspended in 10 
ml fresh (4 °C) lysis buffer (below) and stored at -20 ° C overnight prior to cell breakage 
10 using the 25 ml X-Press ^stem (AB Biox) 

LYSIS BUFFER 

50mMTrisHClpH8.0 
1 mM EDTA 
15 lOOmMKCl 

0.1 mM AEBSF (amino ethyl benzene sulphonic acid) 

The lysate can be centrifuged in an Oakridge tube (24 300 x g, 4 °C, 30 minutes). The 
pellet can be resuspended in 20 ml of ice cold lysis buffer on ice using a Silverson lab 

20 blender. The lysate can be split int» 8 x Oakridge tubes and each aliquot can be diluted to 
30 ml (total 120 ml) in ice-cold Lysis Buffer. Inclusion bodies can be pelleted (24 300 x 
g, 4 "C, 30 minutes) and the supernatant discarded. Each of the 8 pellets can be 
resuspended in 30 ml of lysis buffer, and the inclusion bodies can be harvested by 
centrifugation again (24 300 x g, 4 °C, 30 minutes). This wash/centrifugation step can be 

25 performed five times in total to clean the inclusion body fraction. Each pellet can be 
resuspended in 15 ml of ice-cold water and the inclusion bodies can be stored at -20 "C. 



p 
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For refolding, 200 ml solution of 2% (w/v) N-lauryl sarcosine (NLS) in 50 mM Tris HCl 
pH 9.0 can be prepared as follows. 4 g NLS can be solubilised in 100 ml water with 
stirring. 1 0 ml of IM Tris pH 9.0 stock solution can be added and made up to 1 95 ml with 
water. 5 ml of inclusion body slurry can be added and stirred vigorously for 30 minutes at 
5 room temperature* 

CuClj can be added to a concentration of 100 ^iM. This serves as a catalyst for oxidation. 
The refolding reaction can be transferred to 4 ''C and stirred vigorously for 2 days to 
promote aeration. The refolding reaction can be vacuimi filtered through a 0.44 jiM 

10 vacucup bottle top filter unit (90 mm diameter, Gellman Sciences) and tiie filtrate 
transferred to a Pellicon Labscale TFF system fitted with PLGCIO membrane unit 
(Millipore). The reaction can be concentrated to 25 ml using tangential flow, discarding 
the permeate (the scFv is localised to the retentate). The solution can be diafiltered against 
40 X tum-over volxmies (1 L) of 10 mM ammonium acetate (AAT) pH 9.0. Finally, the 

15 volume of the antibody can be diluted to 50 ml using 10 mM AAT pH 9.0. The buffer 
exchanged antibody can be stored for 2 hours at 4 ""C. The typical protein content was 1 - 
2 mg ml"^ with a jdeld of up to 50 mg per litre. 

For scFv purification a 10 ml bed volume of Ni NTA superflow agarose Qiagen can be 
20 prepared according to the manufacturers instructions using a 10 ml glass column (Sigma) 
without flow adaptors. The column can be equilibrated with 50 ml Buffer B (6M urea, 0. 1 
M NaH2P04, 10 mM Tris HCl pH 8.0). Refolded scFv (as described above)can be diluted 
1/5 in Buffer B and applied directly to the column at ml min'^ . 50 ml buffer B can be 
appHed to the column (flow rate 5 ml min '0 followed by 50 ml Buffer C (same 
25 composition as buffer B but pH 6.3). The purified scFv can be eluted from the column by 
slowly applying Buffer B supplemented with 250 mM high grade imida2X)le until the A^go 
< 0.05. 
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Results 

The variable heavy chain (VH) and variable light chains of the antibody genes from 
peripheral blood lymphocytes (PBLs) isolated from C. difficile infected patients have been 
TA cloned and sequenced. In each case the third complementarity determining region (i.e. 
CDR-H3 or CDR-L3) was identified. This region is both the most variable region of the 
VH chain and the area identified as being most important in determining antigen binding, 
for tihis reason this area is used as the signature for each VH chain. 

Clinical Information 

Patient DOl - Variable heavy chains (VH*s) and variable light chains (VL*s) were 
sequenced from PBLs taken ten days after the onset of CDAD. Lab results confirmed C. 
difficile toxin was present 

Patient D02 - VH's and VL's were sequenced from PBLs taken ten days after the onset of 
CDAD. Lab results confirmed C. difficile toxin was present In the seven subsequent 
weeks after sample was taken, the patient did not develop any fiirther C. difficile 
infections, suggesting that the antibodies produced in response to the infection were 
protective. 

Patient D03 - VH's and VL's were sequenced from PBLs taken ten days after the onset of 
CDAD. Lab results confirmed C. difficile toxin was present In &e three subsequent weeks 
after sample was taken, the patient did not develop any further C. difficile infections, 
suggesting that the antibodies produced in response to the infection were protective. 

Sequencing results 

Patient DOl showed a high degree of focusing with both VH's and VL's. Three different 
CDR3 sequences make-up over 80 % of the sequenced VH's. VH's with the CDR3 
sequence of SEQ ID NO: 27 account for 57.8 % of sequenced VH's from this patient 
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VH's with the CDR3 sequence of SEQ ID NO: 28 account for 13,5 % of all VH's, and 
VH's the CDR3 sequence of SEQ ID NO: 29 account for 10 % of all sequenced VH's in 
this patient. Three different CDR3 sequences also make up the majority of sequenced 
VL's, these are the CDR3 sequences of SEQ ID NOs: 32, 34, and 33 (occurring at 
5 frequencies of 32.7 %, 15.3%, and 6.6%, respectively). 

Similarity searches were performed using these common VH and/or VL sequences against 
other VH and/or VL sequences contained within a database (FABTEC database), which 
contains VH and/or VL regions of PBLs isolated from patients with methicillin-resistant 

10 Staphylococcus aureus (MRSA), Candida albicans^ Pseudomonas aeruginosa (the 
causative agent of cystic fibrosis). Streptococcus oralis and vancomycin-resistant 
Enterococci (VRE) infections and to idrartify antibody sequences which are associated with 
resistance to the infections and which can therefore be used to effect therapy against those 
infections. These sequences have formed the basis of WO 01/76627. For all of these, the 

IS isolated VH and/or VL sequences demonstrate skewing of the antibody repertoire over the 
course of infection, thus revealing the identity of matured, immunity-conferring VH and/or 
VL sequences. This has typically been done using 5,000 to 15,000 sequences for each VH 
and/or VL. VH and/or VL repertoires from healthy individuals can also be used for 
comparison purposes. 

20 

Using the BLOSUM50 matrix it was possible to pick out groups of antibodies with 
CDR3's that had greater than 70% similarity. The CDR3 sequences of SEQ ID NOs: 27 
and 28 (Table 2 and 3, respectively), showed sequence similarity of greater than 70% only 
with other VH's from C difficile infected patients VH's sequenced from individuals with 
25 other infections (C albicans^ MRSA and VRE) showed no similar CDR3 sequences. 

In the following Tables, libraries starting D are derived from C difficile infected patients, 
libraries starting M are derived from MRSA infected patients, libraries starting V are 
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derived &om VRE infected patients; and libraries starting C are derived from C. albicans 
infected patients. 



Table 2 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


27 


100.0 


DOl 


184 


27 


100.0 


D03 


5 


35 


100.0 


DOl 


1 


36 


100.0 


DOl 


3 


37 


100.0 


DOl 


1 


37 


100.0 


D03 


1 


38 


93.75 


DOl 


1 



Tables 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


28 


100.0 


DOl 


43 


28 


100.0 


D03 


1 


39 


100.0 


DOl 


1 


40 


88.88 


DOl 


1 



VH's with the CDR3 sequence of SEQ ID NO: 29 showed similarity to VH's from 
other C. difficile infected patients and MRSA infected patients (Table 4). 




Table 4 



CDR3SEQ1DNO: 


Similarity 


Library 


No. of clones 


41 


100.0 


DOl 


1 


29 


100.0 


DOl 


32 


29 


100.0 


MOl 


1 


29 


100.0 


M02 


5 


29 


100.0 


M03 


2 


29 


100.0 


M04 


8 


42 


100.0 


M03 




43 


100.0 


DOl 




44 


93.75 


DOl 




45 


93.75 


DOl 




46 


75.0 


M04 




47 


70.58 


M05 





The CDR3 sequences from the VL*s of patient DOl have also had a similarity search 
performed on them. VL's with the CDR3 sequences of SEQ ID NOs: 32 and 33 (Tables 
5 and 6, respectively) show a high degree of homology with CDR3 sequences from C. 
20 difficile infected patients but no homology to CDR3 sequences derived from patients with 
different clinical mfections. VL's with the CDR3 sequence of SEQ ID NO: 34 showed 
similarity to VL's isolated from C. difficile, MRSA and VKE infected patients (Table 7). 
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Table 5 



CDR3SEQIDNO: 


Similarily 


Library 


No. of clones 


32 


100.0 


D51 


64 


48 


90.9 


D52 


2 


49 


90.9 


D52 




50 


90.9 


D52 




51 


90.0 


D51 




52 


90.0 


D52 




53 


81.8 


D52 




54 


81.8 


D52 




55 


81.8 


D52 




56 


81.8 


D52 




57 


80.0 


D52 




58 


72.7 


D52 




Table 6 


CDR3SEQIDN0: 


Similarity 


Library 


No. of clones 


33 


100.0 


D51 


13 


59 


90.9 


D52 


1 
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Table 6 continued 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


60 


83.3 


D52 


1 


61 


81.8 


D52 


4 


62 


81.8 


D52 


1 


63 


81.8 


D52 




64 


81.8 


D52 




65 


81.8 


D51 




66 


81.8 


D52 




67 


81.8 


D52 




68 


72,7 


D52 




69 


72.7 


D52 





Table? 



CDR3SEQIDN0: 


Similarity 


Library 


No. of clones 


34 


100.0 


D51 


30 


34 


100.0 


M56 


2 


34 


100.0 


V54 


2 


70 


88.9 


D51 


1 


71 


88.9 


D51 


1 


72 


88.9 


V54 


2 


73 


77.9 


M55 


1 


74 


77.9 


D51 


1 


75 


77.9 


M55 


1 


76 


77.9 


M55 


1 
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Patient D02 also showed a high degree of focusing of the VH rq)ertoire, although focusing 
is less obvious in the VL repertoire. Two types predominate in the VH library of this 
patient. VH's with the CDR3 sequence of SEQ ID NO: 30 accounted for 45.6% of 
sequenced clones and those with the CDR3 sequence of SEQ ID NO: 31 accounted for 
8.8% of the library. In the VL library the three most frequently isolated clones with the 
CDR3 sequences of SEQ ID NOs: 61, 96, and 97 were found at 4%, 4% and 3%, 
resfpectively. 

Similarity searches in the FABTEC database showed the two common VH CDR3 
sequences to be specific to C. difficile patients (Tables 8 and 9). VH's with the CDR3 
sequence of SEQ ID NO: 30 match with other antibodies in that library and antibodies 
from Uhraiy D03 (Table 8). VH's with the CDR3 sequence of SEQ ID NO: 3 1 match with 
antibodies from library D03 (Table 9). 



Table 8 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


77 


100.0 


D02 




78 


100.0 


D02 




30 


100.0 


D02 


52 


30 


100.0 


D03 




79 


100.0 


D02 




79 


100.0 


D03 




80 


94.4 


D02 




81 


94.4 


D02 




82 


94.4 


D02 




83 


88.8 


D02 




84 


77.7 


D03 
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Table9 



CDR3 SEQIDNO: 


Similarity 


Library 


No. of clones 


31 


100.0 


D02 


10 


31 


100.0 


D03 


4 


85 


90.0 


DOS 


1 


86 


90.0 


D03 


1 



Similarily searches using some of the VL's showed a clustering of sequences. 15% of VL's 
from patient D02 (library D52) showed high degree of similarity to the CDR3 sequ^ce 
of SEQ ID NO: 48, VL*s from patient DOl (library D51) also showed a high degree of 
homology to this sequence (Table 10). In a similar way a cluster aroimd the sequence of 
SEQ ID NO: 61 accounts for 13% of patient D02 sequenced VL's, again clones similar to 
this one are also found in patient DOl (Table 11). Neither of these clones show any 
significant similarity to VL CDR3 sequences isolated from patients infected wifli other 
pathogens. It should also be note that using the BLOSUM50 matrix these two clones show 
greater than 60% similarity to each other. 



Table 10 



CDR3SEQIDNO: 


Similarity 


Library 


No. of clones 


48 


100.0 


D52 


2 


49 


100.0 


D52 


1 


53 


100.0 


D52 


1 


32 


90.9 


D51 


64 


87 


81.8 


D52 


2 


51 


81.8 


D51 


1 


88 


81.8 


D52 


1 
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Table 10 continued 



CDR3SEQIDNO: 


Similarity 


Library 


No. of clones 


57 


72.7 


D52 


1 


89 


72.7 


D52 


1 


50 


11.1 


D52 


1 


54 


12,1 


D52 


1 


55 


72.7 


D52 


2 



Table 11 



CDR3SEQIDN0; 


Similarity 


Library 


No. of clones 


61 


100.0 


D52 


4 


65 


100.0 


D51 




59 


81.8 


D52 




33 


81.8 


D51 


13 


62 


81.8 


D52 




63 


81.8 


D52 




60 


75.0 


D52 




64 


72.7 


D52 




66 


72.7 


D52 




68 


72.7 


D52 




67 


72.7 


D52 





Patient DOS has only preliminary results for VH's but shows many sunilar sequences to 
both DOl and D02 so are worth reporting. Only data from 49 VH sequences have been 
produced. 10% of the clones have the CDR3 sequence of SEQ ID NO: 27, this is the 
sequence of the most common VH in patient DOl. 8% of VH's have the CDR3 sequence 
of SEQ ID NO: 3 1 - this is the second most common VH CDR3 sequence in patient D02. 
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Cross-Libraiy comparison (Table 12) of the CDR3 sequences of VH's has shown 8 CDR3 
sequences which appear in more than one of the C. difficile infected patients. The CDR3 
sequences of SEQ ID NOs: 27, 28, and 37 appear m patients DOl and D03 and the 
sequencesof SEQIDNOs: 31, 90, 91, 79, and 30 all appear in patients D02 and D03. 
So far patients DOl and D02 show no common CDR3 sequences. VL's have only been 
sequenced firom patients DOl and D02, these patients share no identical CDR3 sequences. 



Table 12 



VHCDR3SEQIDNO: 


Patient 


27 


DOl and D03 


28 


DOl andD03 


37 


DOl andD03 


31 


D02andD03 


90 


D02andD03 


91 


D02andD03 


79 


D02andD03 


30 . 


D02andD03 



Summary 

The presence of a large number of a particular VH CDR3 sequence in a patient with a 
C difficile infection indicates that VH as part of an scFv may be protective against the 
organism. CDR3 sequences that are shared by more than one patient indicate specific 
C. difficile sequences. It should also be noted that sequences with a high degree of 
homology often due to somatic mutation within the patient may also represent 
important sequences with similar but subtly different properties. Analysis of the CDR3 
sequences of the VH firom our individuals has lead to the identification of 17 
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potentiaUy protective VH's. These VH CDR3 sequences correspond to SEQ ID NOs: 
30, 79, 28, 91, 41, 39, 38, 27, 36, 36, 40, 37, 90, 29, 44, 45, and 43. 

A similar analysis of the VL has identified 44 different VL CDR3 sequences. The larger 
5 number is due to the increased rate of somatic mutation in antibody light chains leading 
to VL's with slightly different sequences. These VL CaDR3 sequences correspond to SEQ 
ID NOs: 59, 60, 61, 33, 62, 63, 64, 65, 66, 77, 84, 78, 30, 79, 80, 81, 82, 87, 57, 74, 85, 31, 
86, 82, 51, 48, 88, 49, 53, 32, 52, 70, 71, 34, 89, 68, 67, 50, 54, 55, 56, 59, 58, 96, and 97. 



10 
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1 . An antibody or an antigen binding fragment thereof having the CDR-H3 sequence 
selected from the group consistmg of: SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 
29, SEQ ID NO: 30, and SEQ ID NO: 31. 

2. An antibody or an antigen binding fragment thereof having the CDR-L3 sequence 
selected from tiie group consisting of: SEQ ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 
34. 

3. An antibody or an antigen binding fragment thereof having a CDR-H3 sequence 
selected from the group consisting of: SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, and SEQ ID NO: 31, and a CDR-L3 sequence selected from the group 
consisting of: SEQ ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34. 

4. A method for identifying candidate sequences of at least the CDR3 region of 
antibodies specific against at least one antigen produced by Clostridium difficile during an 
infection or against a vaccine, comprising the steps of: 

(i) with B cells isolated from at least one patient who has been infected by 
Clostridium difficile or administered said vaccine, sequencing at least tiie 
CDR3 region of the VH and/or VL coding regions of said B cells; and 

(ii) correlating said sequenced at least the CDR3 regions of the VH and/or VL 
coding regions of said B cells from said at least one patient to identify a set 
of candidate sequences for at least a CDR3 region of antibodies specific 
against said at least one antigen produced by Clostridium difficile or against 
said vaccine, each of said set of candidate CDR3 sequences or a sequence 
having at least 80% homology therewith occurring in total at a frequency of 
at least 1 percent in the set of sequences determined at step (i). 
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5. A method according to claim 4, said B cells being selected from tiie group 
consisting of peripheral B-cell lymphocytes and B cells from the spleen. 

6. A method according to claim 5, said peripheral B-cell lymphocytes being isolated 
from blood from said at least one patient 

7. A method according to any of claims 4-6, said at least one antigen being an 
immunogen. 

8. A method according to any of claims 4-7, said at least one patient displaying a 
pronounced antibody response in response to infection by Clostridium difficile. 

9. A method according to any of claims 4-8, said at least one patient having recovered 
from infection by Clostridium difficile. 

10. A method according to any of claims 4-9, said correlation step (ii) comprising 
determining putative amino acid sequences fix)m said sequenced at least fbs VH and/or VL 
CDR3 coding regions, and correlating said putative amino acid sequences. 

11. A method according to claim 9, said correlation step (ii) comprising identifying the 
Complementarity Determining Regions comprised in said at least the VH and/or VL 
regions and correlating said Complementarity Determining Regions. 

12. A method according to claim 1 1, said Complementarity Determining Regions being 
selected from the group consisting of CDRl, CDR2 and CDR3. 

13. A method according to claim 4-12, said correlation step (ii) additionally correlating 
at least one of the group consisting of: the strain of Clostridium difficile infecting said at 
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least one patient, the time point at which said B cells are isolated during infection of said 
at least one patient by Clostridium difficile, the age of said at least one patient, the sex of 
said at least one patient, and the race of said at least one patient 

14. A method according to claim 4-13, said B cells having been isolated from said at 
least one patient at a plurality of time points during infection of said at least one patient by 
Clostridium difficile, said correlation step (ii) correlating the time point during faifection 
of said at least one patient by Clostridium difficile at which said B cells are isolated. 

15. A method according to claim 4-13, said B cells having been isolated from at least 
two patients, at least one of whom has recovered from infection by Clostridium difficile, 
and at least one of whom has not recovered from infection by Clostridium difficile, said 
correlation step (ii) correlating the recovery of said at least two patients fixMn infection by 
Clostridium difficile. 

16. A method according to claim 4-13, said B cells having been isolated from at least 
two patients, said patients being infected by different strains of Clostridium difficile 
producing said at least one antigen, said correlation step (ii) correlating said sequenced at 
least the VH and/or VL coding regions of said B cells to identify a set of candidate 
sequences for antibodies, each of which is specific against at least one shared antigen 
produced by said different strains of Clostridium difficile or is specific against different 
antigens produced by said different strains of Clostridium difficile. 

17. A method of manufecture of a medicament for the treatment of an infection by 
Clostridium difficile which produces at least one antigen, comprising the steps of: 

(i) performing a method according to any one of the preceding claims; and 

(ii) synthesising at least one antibody comprising at least one candidate sequence 
specific against said at least one antigen produced by Clostridium difficile. 
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ls. A metibod of manufacture of a medicament according to claim 17, comprising the 
step of combining said synthesised at least one antibody witii a pharmaceutically 
acceptable carrier, diluent or excipient 

19. A method of treatment of an infection of a patient by Clostridium difficile which 
produces at least one antigen, comprising the steps of: 

(i) performing a method according to any one of claims 4-16; 

(ii) synthesising at least one antibody comprising at least one candidate sequence 
specific against said at least one antigen produced by Clostridium difficile; and 

(iii) administering a therapeutically effective quantity of said at least one 
synthesised antibody to said patient 

20. A method of producing a database which identifies candidate sequences for 
antibodies specific against at least one antigen produced by Clostridium difficile, 
comprising the steps of: 

(i) performing a method according to any one of claims 4-16; and 

(ii) storing the data produced by said method in said database. 

21. A method of generating a report which identifies candidate sequences for antibodies 
specific against at least one antigen produced by Clostridium difficile, comprising the steps 
of: 

(i) performing a method according to any one of claims 4-16; and 

(ii) producing a report comprising the data produced by said method. 

22. A method for determining the efficacy of a vaccine, comprising the steps of: 

(i) with B cells isolated fi:om at least one patient who has been administered 
said vaccine, sequencing at least the CDR3 region of the VH and/or VL 
coding regions of said B cells; and 
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(ii) correlating said sequenced at least the CDR3 region of the VH and/or VL 
coding regions of said B cells to identify a set of candidate sequences for at 
least the CDR3 region of antibodies specific against said vaccine, each of 
said set of CDR3 candidate sequences or a sequence having at least 80% 
homology therewith occurring in total at a frequency of at least 1 percent in 
the set of sequences determined at step (i). 

23. A method according to claim 20, correlation step (ii) comprising correlating said 
sequenced at least a CDR3 region of the VH and/or VL coding regions of said B cells with 
sequenced at least a CDR3 region of the VH and/or VL coding regions of B cells isolated 
from at least one patient who has been infected with Clostridium difficile against which 
vaccination with said vaccine is intended to stimulate a protective immune response. 

24. A method according to claim 22, said correlation step (ii) additionally correlating 
at least one of the group consisting of: the time since administration of said vaccine to said 
at least one patient, the age of said at least one patient, the sex of said at least one patient, 
the race of said at least one patient, and the Complementarity Determining Regions of said 
sequenced at least the VH and/or VL coding regions. 

25. A method according to claim 22, said method determining tihe efiScacy of said 
vaccine in stimulating a protective immime response against Clostridium difficile against 
which vaccination with said vaccine is intended to stimulate a protective immune response. 

26. A method of producing a database which identifies the efficacy of a vaccine, 
comprising the steps of: 

(i) performing a method according to iany one of claims 22-25; and 

(ii) storing the data produced by said method in said database. 
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27- A method of generating a report which identifies tiie efficacy of a vaccine, 
comprising the steps of: 

(i) performing a method according to any one of claims 22-25; and 

(ii) producing a report comprising the data produced by said method, 

28. A diagnostic test method for identifying a Clostridium difficile infection in a patient, 
comprising the steps of: 

(i) with B cells isolated firom said patient, sequencing at least the CDR3 region 
of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or VL 
coding regions of said B cells with a set of sequences for at least the CDR3 
region of antibodies specific against Clostridium difficile^ and determining 
whether each of said set of CDR3 sequences or a sequence having at least 
80% homology therewith occurs in total at a frequency of at least 1 percent 
in the set of sequences determined at step (i); and 

(iii) correlating the results of comparison step (ii) to determine ttie presence or 
absence of a Clostridium difficile infection in said patient 

29. A diagnostic test method for determining the susceptibility of a patient to 
Clostridium difficile infection, comprising the steps of: 

(i) with B cells isolated fi-om said patient, sequencing at least the CDR3 region 
of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or VL 
coding regions of said B cells with a set of sequences for at least the CDR3 
region of antibodies specific against Clostridium difficile^ and determining 
whether each of said set of CDR3 sequences or a sequence having at least 
80% homology therewith occurs in total at a firequency of at least 1 percent 
in the set of sequences determined at step (i); and 
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(iii) correlating the results of comparison step (ii) to determine the susceptibility 
of said patient to Clostridium difficile infection. 

30. A diagnostic test kit for performing a diagnostic test method according to any one 
5 of claims 28-29. 



-54- 



ABSTRACT 



Clostridium difficile focussed antibodies 



The present invention is concerned with antibodies specific for and which confer immunity 
against infection by C difficile or against a vaccine, methods of identifying same, methods 
of manufacture of medicaments, and methods of treatment of patients using same. Also 
provided is a method for determining the efficacy of a vaccine, together with methods of 
vaccinating a patient, diagnostic test methods and diagnostic test kits. 
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